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Abstract 

This paper is about the structural equation modelling of quantitative mea- 
sures that are obtained from a multiple facet design. A facet is simply a set 
consisting of a finite number of elements. It is assumed that measures are 
obtained by combining each element of each facet. Methods and traits are 
two such facets, and a multitrait-multimethod study is a two-facet design. 
We extend models that were proposed for multitrait-multimethod data by 
Wothke (1984; 1996) and Browne (1984, 1989, 1993), and demonstrate how 
they can be fitted using standard software for structural equation modelling. 
Each model is derived from the model for individual measurements in order 
to clarify the first principles underlying each model. 


Introduction 

A Multi-Trait Multi-Method (MTMM) study is characterized by mea- 
sures that are composed as combinations of traits and methods. In this paper, 
we will treat a more general case where measures are composed as combi- 
nations of elements of facets. A facet is simply a set consisting of a finite 
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number of elements, usually called conditions. Facets refer to properties of 
the measures or measurement conditions. Methods and traits are two such 
facets, and a MTMM study is a (fully crossed) two-facet design. Facets need 
not be methods or traits or anything in particular. Consider, for example, a 
study that is presented by Browne (1970) and discussed in detail by Joreskog 
and Sorbom (1996, section 6.3). Persons were seated in a darkened room 
and required to place a rod in vertical position by pushing buttons. The score 
was the (positive or negative) angle of the rod from the vertical. Each person 
had to perform the task twice in a number of different situations which where 
constructed according to a two-facet design. The two facets were the posi- 
tion of the chair and the initial position of the rod, each with three conditions. 
The occasion of the experiment may be considered the third facet with two 
conditions. 

We assume that measures were constructed for each combination of the 
facets and that we have data for each measure. We further assume that mea- 
sures are continuous. We believe that the case of discrete data is more appro- 
priately handled using item response theory models (e.g., Bechger, Verhelst 
& Verstralen, 2001). 

To analyze data from a multiple facet design, we extend two models 
that were suggested for the MTMM design: the covariance component model 
(Wothke, 1984, 1996), and the composite direct product model (Browne, 
1984, 1989, 1993). In doing so, we pursue e.g., Bagozzi, Yi and Nassen, 
(1999), Cudeck (1988), or Browne and Strydom (1997) who suggest general- 
ization of the composite direct product model to multiple facets. Our objective 
is to demonstrate how researchers who know the basic principles of structured 
equation modelling (SEM) may formulate and fit these models using the LIS- 
REL (Joreskog & Sorbom, 1996) or the Mx program 1 (Neale, Boker, Xie, & 
Maes, 2002). There are several alternative software packages but the major- 
ity of these have an interface that is similar to that of LISREL or Mx. For a 
general introduction to SEM we refer the reader to Bollen (1989). 

'The Mx program is free-ware and can, at present, be obtained from the internet address 
http://www.vcu.edu/mx/index.html 
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Each model is derived from the (data) model for individual measure- 
ments in order to clarify the first principles underlying each model. In the con- 
text of MTMM studies, the model for the observations is of less interest since 
the main objective is to establish a structure for the correlations that relates 
to the Campbell and Fiske (1959) criteria for convergent and discriminant va- 
lidity. However, in general multi-facet studies the data model is important as 
a substantive hypothesis that guides the interpretation of the parameters. We 
demonstrate how each of the models can be fitted to a correlation or covari- 
ance matrix. (We refer to Cudeck (1989) for a survey of the issues concerning 
the analysis of correlation matrices). As an illustration, we discuss a number 
of applications to real data. 


Preliminaries 

Persons (or, more general, objects) are assumed to be drawn at random 
from a large population and each observation is taken to be a realization of 
a random vector x of measurements made under combinations of conditions 
of multiple facets. All models that are considered here are based upon the 
following linear model for the observations 

X = p + 7] + U , 

where p, denote the mean of x, and the latent variable 77 represents true or 
common scores. The components of u represent measurement error and are 
assumed to be uncorrelated with mean zero and variance matrix D, t . The 
common scores are uncorrelated to u; they have zero mean and covariance 
matrix S, 7 . It follows that the data have mean vector p. and covariance matrix 

S = + D u , 

where D a is a diagonal matrix. 

We use the following notation: In a design with multiple facets, A, B, 
C, D, etc. denote the facets. Each facet has several conditions (or elements) 
denoted by A*, B y , etc. The number of conditions in each facet is denoted 
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by the lowercase of the letter that is used to denote the facet. For example, 
a = 3 if facet A has three conditions. We use F to denote a generic facet. The 
number of measures that can be constructed (e.g., a x b x c if there are three 
facets) will be denoted by p. The number of facets will be denoted by #F. The 
sum of the conditions in each of the facets (e.g., a + b + c) will be denoted 
by #f. The symbol I 0 denotes an identity matrix of dimension a and l a a 
unit vector with a elements. The symbol ® denotes the Kronecker or direct 
product operator with A® B = (a^B) . Finally, the vector z F denotes a vector 
of random variables associated with a facet; that is, z F = (z(Fi ), . . ., z(Ff)) T , 
where z(Fj) denotes a random variable associated with the j-th condition of 
facet F. Uppercase T refers to transposition. 

The Covariance Component Model 

Wothke (1984) suggested that the Covariance Component (CC) model 
described by Bock and Bargmann (1966) be used for MTMM data. In this 
section, we discuss a number of parameterizations of the CC model and 
demonstrate how each is specified within the LISREL framework. Note that 
the CC model is related to random effects analysis of variance (see Bock & 
Bargmann, 1966, pp. 508-509) but we will not explicitly use this relationship 
in our presentation of the model. 

Introduction 

Let rj x denote a generic element of 77 ; i.e., x is a measurement obtained 
as the combination of Bj, . . . , E r . In the CC model, 77 is assumed to have 
an additive structure. Specifically, 

Vx — 9 + z(Ai) + z(Bj ) + . . . + z(E r ) , 

where g denotes a within-person mean. In matrix notation: 

77 = Az , 

where z = ( g , z^, z^, . . . , z^) T , and A is a p x (1 + #/) incidence matrix; 
that is, a matrix whose entries are zero or one. The rows of A indicate all 
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combinations of conditions of each of the facets. For two to five facets, the 
structure of A is given in Equation 1. 


[1 p I a 8 1 b la 8 h] 

[1 p I a ® 1 be la 8 Ife 8 1 c lab 8 I C ] (1) 

[Ip la *8* 1 bed 1 a 8 lb 8 1 cd 1 ab 8 Ic 8 lrf labc *8* Ic] 

[lp la 8> 1 berfe la 8 lb 8 lerfe lab 8 Ic 8 lrfe labc 8 Irf 8 le ^-abed 8 le] 

For example, if there are two facets, with two conditions each: 


110 10 
110 0 1 
10 110 
10 10 1 


( 2 ) 


Note that Equation 1 was derived assuming that each subsequent facet is 
nested in the preceding facet(s); e.g., AiBiCi, A\B\C 2 , AiB 2 Ci, etc. It is 
easy to see the general pattern in Equation 1 and derive expressions for more 
than five facets. 

It is assumed that z is multivariate normally distributed with zero mean 
and covariance matrix £ z . Furthermore, each facet is assumed to have an 
independent influence on the measurements so that S, is block-diagonal; that 

is, 


= diag (cr|, • • • , S E ) = 


a~ 


'B 


( 3 ) 


where £ f denotes a within-facet dispersion matrix. It follows that 


£ = A£,A r + D, 


( 4 ) 


This is a confirmatory factor analysis (CFA) model with a constant factor 
loading matrix (see Bollen, 1989, chapter 7). As it stands, the model is not 
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identifiable. Informally, this means that no amount of data will help to deter- 
mine the true value of one or more of the parameters. We will demonstrate 
this by constructing an equivalent model with less parameters. 

The matrix A has 1 + #/ columns and rank equal to r(A) = 1 + #/ — 
# F (e.g., Equation 2). Since A has deficient column rank, the vector of ran- 
dom components that satisfies 77 = Az need not be unique. Consider an ex- 

T 

ample with two facets with two conditions each. Then, if zi = (1, 2, 3, 4, 5) 
and z 2 = (7, 0, 1, 0, 1) T , 77 = Az x = Az 2 . 

If z is a solution to 77 = Az we may write any other solution z* as 

g + z(Ai) + z(Bi) 2v\ + n 2 + tq 
0 v 2 

z* = z(A 2 ) - z{Ai) + 2v 3 - v 2 (5) 

0 V4 

z{B 2 ) - z(Bi) 2v 5 - v 4 

where v\ to v 5 are arbitrary constants (e.g., Pringle & Rainer, 1971, p. 10). 
The second and fourth elements of z* are arbitrary which means that the cor- 
responding entries of S 2 are arbitrary and therefore not identifiable. Specif- 
ically, if T, z * denotes the covariance matrix of z*, it is easily checked that 
AXh*A r equals A'E z A r , where X 2 has seven parameters and E 2 * five (see 
Equation 8). 

The first vector in (5) contains linear combinations £ = Lz of the ran- 
dom components that are common to all solutions. In general, L denotes a 
r(A) x (1 + 7 (ff) matrix of full row rank. In the example 2 , 


g + z(Ai) + z(B\) 


"l 

1 

0 

1 

0 " 

z(A 2 ) — z(Ai) 

= Lz = 

0 

-1 1 

0 

0 

1 

s 

Tf 

1 

CN 


0 

0 

0 

-1 Ij 


We will not give a general expression for L but note that, in general, the first 
linear combination £1 is the common score of the first measurement. The 
other linear combinations are within-facet deviations from the first condition. 

2 It is easily checked that Lzi = Lz 2 = (7, 1, 1). 
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We now wish to find an equivalent expression of 77 in terms of That 
is, we look for a matrix A such that Az = A£. In our example, 


Az 


110 10 
110 0 1 
10 110 
10 10 1 


1 

9 


z(Ai) 


z{A 2 ) 


z (Bi) 




g + z(Af) + z(B 1 ) 
g + z(Af) + z{B 2 ) 
g + z(A 2 ) + z{Bf) 
g + z(A 2 ) + z(B 2 ) 


'1 0 o' 



10 1 


g + z(Afj + z(Bi) 

110 


z(A 2 ) — z(A x) 

1 1 1_ 


z(B 2 )^z(B 1 ) 


A^ 


It is seen that A is equal to A with columns corresponding to the first con- 
dition in each facet deleted. In general, A has the same structure as A in 
Equation 1 except that the first column in each of the identity matrices in (1) 


0 


/-1 If-i 


is deleted; that is. If is replaced by 
It follows that 

£ = AS, A t + D, 


(6) 


where £, = L£L 7 denotes the dispersion matrix of This CFA model 
is easily fitted with LISREL or Mx. Note that S^ does not inherit the block- 
diagonal structure of £, because £1 = g + z{Af) + • • • + z(E 1 ) correlates 
to each of the within-facet deviations. However, if S 2 has the postulated 
block-diagonal structure, the within-facet deviations should be uncorrelated 
between facets. 


Alternative Parameterizations I 

The matrix A was constructed by deleting columns from A. Hence, A 
may be written as the matrix product AP, where P is an incidence matrix 
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that serves to delete columns from A. It follows that we may express a model 
that is equivalent to (6) as: 


E = A'E z *A r + D, t 


(7) 


where S,* = PEcP 7 . In the example, 


E 


Z* 


E [g] 

0 

£[6,6] 

0 

£[6,6] 

0 

0 

0 

0 

0 

£[6,6] 

0 

£ [6 2 ] 

0 

0 

0 

0 

0 

0 

0 

E [6,6] 

0 

0 

0 

£ [® 


( 8 ) 


Instead of fitting the model (6), we may fit a model of the form (7) with ap- 
propriate restrictions on S 2 *. In general, these restrictions are that all entries 
in E 2 * involving the first condition of any of the facets are fixed to zero, as in 
( 8 ). 


Alternative Parameterizations II 

Let T denote an arbitrary, non-singular matrix and define A* = AT _1 , 
and E^* = TS/T 7 . It is not difficult to see that any model that can be written 
as: 

E = A*E 5 *A* t + B u (9) 

is equivalent to (6). The matrix E^» denotes the dispersion matrix of 

6 = = TLz = L*z . 

Hence, each alternative model of the form (9) implies a set of linear combi- 
nations defined by L* = TL. 
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Browne (1989) considers linear combinations of the form: 


c = L*z = 


9 + \ + z(A 2 )) + l + z(B 2 )) 

= | ( 2 (^ 2 ) - 2 (^ 1 )) 

The corresponding matrix T, is found be solving L = TL. Here, 


L* = 0 


1 1 

2 2 

_ 1 1 

2 2 


0 -i 1 

u 22 . 


0-11 0 0 
0 0 0 -1 1 


The corresponding A* is 

fl -1 -1 


= AT^ 1 = 


For later reference, we call this parametrization Browne ’s parametrization. In 
general, if we use Browne’s parametrization, A* has the same structure as A 
in Equation 1, except that each of the identity matrices lj in (1) is replaced 

r 1 t 

by the matrix — l/-i If_i . There is, of course, an infinite number of 
alternative parameterizations, each corresponding to a non-singular matrix T 
and a matrix L* (see e.g., Bock & Bargmann, 1966, table 6). 

Browne (1989) implicitly assumes that sub-matrices E^ in (3) have 
equal diagonal elements. This implies that Eg* = L*E-I/ T is block- 
diagonal. For instance, 

£K,£ 2 *] = E 1 -(z(A l ) + z(Af), l -(z(Af^z(Af) 

= 7 ~ ^(Ai)) » 



262 


T.M. Bechger and G. Maris 


which is zero if and only if crjA) = (J ‘l(A 2 y 

Fitting the Covariance Component model to a Correlation Matrix 

To fit the model to a correlation matrix we need to derive the model 
for the correlation matrix P = D,XD,, where D x = diag~ ^ (£); a diagonal 
matrix with on the diagonal the inverses of the population standard-deviations 
of the observed measures. That is, 

P = D x (S, ? + D n ) D x = B x (AS C A 7 + Y) x . (10) 

This model is easily fitted with LISREL. One may, as in (10), use our first 
parametrization and specify: ny = ne = p, nk= r (A), PSI= 0, LAMB DA- Y 
= D x , GAMMA = A, and PHI = XL. An example of a LISREL script is 
provided in the Appendix. Only small changes are necessary to specify 
as in (7) or (9). Note that (10) is a special case of the scale-free covariance 
model proposed by Wiley, Schmidt, and Bramble (1973). 

There is a caveat however. Wothke (1988; 1996) claims that the vari- 
ance of 0 and covariances involving () are not identifiable. Hence, in general, 
model (10) is unsuited for correlation matrices. As mentioned before, Browne 
(1989) assumes that the within-facet dispersion matrices have equal diagonal 
elements. Suppose that this assumption holds. Then, if Browne’s parametriza- 
tion is used, and the matrix in (10) is specified as A*S^*A* r , the matrix 
is block-diagonal and the model may be used for correlation matrices 
provided the variance of is known. Browne (1989) sets the variance of 
to one. This means that all variances and covariance must be interpreted 
relative to the variance of £* . 

Applications of the Covariance Component 
Model 


To a Covariance Matrix 

Lor illustration we apply the CC model to three-facet data gathered by 
Hilton, Beaton, and Bower (1971). The data consist of 2163 measurements of 
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two traits (facet A), measured with two instruments (facet B) on three occa- 
sions (facet C). Details can be found in Cudeck (1988). We use the covariance 
matrix reported in table 4 by Cudeck (1988) to fit the CC model. 

Cudeck notes that the data show large kurtosis and goodness-of-fit 
statistics that are based upon the assumption that the data are normally dis- 
tributed may not be trusted. Following Cudeck, we provide the normal-theory 
generalized least-squares estimates of the parameters in 

92.54 

6.78 11.94 

14.22 11.38 20.02 

-3.35 -1.22 -1.56 8.23 

-1.44 0.22 1.93 -5.41 44.04 

As judged from the standardized residuals the CC model did not fit the data. 
The CC model has an adjusted goodness-of-fit index (AGFI) of 0.91 and a 
standardized root-mean-square residual (RMS) of 0.31. Hence, no substan- 
tive conclusions should be drawn from these parameter estimates and we 
merely present them to enable readers to check their results. Note that Cudeck 
(1988) fitted the multiplicative model that is discussed in the next section to 
the same covariance matrix and found that it fitted the data well. 

To a Correlation Matrix 

Guilford’s (1967) structure of intelligence model can be conceived of 
as a facet design for constructing intelligence tests (Fiske, 1971, p. 128). 
Guilford distinguished three facets. The operation facet refers to the subject’s 
intellectual processing of information; the content facet refers to the content 
of the information; and the product facet to the form of the information. Using 
Guilford’s facet design, Hoeks (1985) constructed eighteen tests measuring 
the content element semantic abilities. He selected two out of five elements 
of the operation facet: cognition and memory, and three out of five elements of 
the product facet: units, systems and transformation of information. Hence, 
the tests where constructed according to a 2 x 3 facet design. For each of 
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the combinations, Hoeks constructed three different tests. The tests may be 
considered as a third facet and we treat the measures as arising from a 2 x 3 x 3 
facet design. The data were analyzed earlier using a standard confirmatory 
factor model by Hoeks, Mellenbergh and Molenaar (1989). We fit the CC 
model to the correlation matrix that they give in their report (see Appendix). 

Following Hoeks, Mellenbergh, and Molenaar (1989) we used un- 
weighted least- squares to fit model (10) to the correlation matrix, assuming 
a block-diagonal structure for Eg*. The CC model reproduced the observed 
correlation matrix well as judged from the residuals. Deviations correspond- 
ing to the third facet (different tests) showed zero variation relative to the 
combination of the first condition in each facet. Thus, we specified a model 
for two facets with three measures for each combination and found a model 
that fitted equally satisfactorily. This model is easily specified by deleting 
the columns of the third facet from the A matrix. Finally, we found that we 
could specify Eg. as a diagonal matrix without visible deterioration of the fit. 
The final model has an AGFI of 0.99, and a RMS of 0.051, comparable to the 
values found by Hoeks, et al. (1989). Other fit indices are not reported be- 
cause they require normal distributions. A LISREL script for the final model 
is in the Appendix. Our analysis suggests that the facet-structure suggested 
by Guilford does indeed hold. 

The Composite Direct Product Model 

The multiplicative data model may be derived from the following mul- 
tiplicative structure for the common score rj: 

T] x = z(Aj)z(B k ) ■ ■ ■ z(E r ) . (11) 

In contrast to the CC model, it is now assumed that the true score is the 
product of a set of latent variables. It is clear that (11) represents a strong 
hypothesis; one that will not often be deemed realistic in social science ap- 
plications. As illustrated in Figure 1 for two facets, different values of z(Ai) 
give the same true score in combination with two different values of z(Bj). 
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Figure 1. A surface plot of q x for different values of z(A t ) and z(Bj) 


Furthermore, it is seen that there is a unique point where the true score is zero. 
Needles to say that, although the latent variables in the CC and in the com- 
posite direct product (CDP) model are represented by the same symbol, their 
interpretation is quite different. In matrix notation, the data model in (1 1) is 

r] = z A <8> z B <S> z c <8> • • • <S> z E ■ (12) 


As with Equation 1, Equation 12 is derived assuming that each subsequent 
facet is nested in the preceding facet(s). 

The dispersion matrix of the common scores is found by expanding 
E [r]r] T ] which gives 


— E 


(z A ® zb 
= E[(z a ®z b < 


Z E ) ( Z A <g) Z B 


Ze) 


■ -®z e ) t 
&•••<£> z t e )\ 


If we assume that the latent variables are normally distributed, and indepen- 
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dent between facets, £, ? has a multiplicative structure; that is, 

= E [z a z t a 8 z B Zg (8) • • ■ z £ z|] 

= £a 8) 8 • • • 8 £_e 

It follows that 

S = S a 8Sb8-*8S b + D u . (13) 

Note that the common scores are not normally distributed under this model 
since the product of normally distributed variables is not normally distributed. 

There is an equivalent specification of the model with standardized 
within-facet dispersion matrices. The true score dispersion matrix is stan- 
dardized by pre- and post-multiplying the dispersion matrices by a diagonal 
matrix D r) = diag~ ^ [S, ; that contains the inverse of the true score standard 
deviations. The multiplicative structure of implies that this matrix is found 
to be structured as follows: 

D r; = 8 Db8-"8Db , 

where = diag~^(H F ). Hence, the correlation matrix of the common 
scores, P^, is 

Pr? DjjS^D ^ 

= (DaEaDa) 8 (DbSbDb) 8 • • • 8 (D^S^D^) 

= Pa8P B -8P E • 

The symmetric matrices P / are the within-facet correlation matrices under 
the CDP model. It follows that 

£ = D" 1 (P„ + D„D tt D„) D- 1 = D- 1 (P„ + D^) D,^ 1 . (14) 

The elements of the p x p diagonal matrix D) ; represent ratios of unique 
variance to common score variance and one minus any diagonal element of 
Df equals the classical test theory reliability of the corresponding measure. 
The matrix D~ 1 has a multiplicative structure since 

D^ 1 = (Da 8 D b 8 • • • 8 De)' 1 = D^ 1 8 D B X 8 • • • 8 D^ 1 . 
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The model in (14) has the same form as (10) and may be fitted using 
LISREL with appropriate (non-linear) constraints on P )? . However, LISREL 
requires that each of the individual constraints be specified and this is cum- 
bersome if there are many of them. Alternatively, the CDP model may be 
formulated as a CFA model (see Wothke & Browne, 1989) but this too is 
cumbersome when the number of facets is larger than two. Fortunately, the 
multiplicative model is easily fitted with the Mx program which incorporates 
the Kronecker product as a model operator. The following box gives a general 
scheme for an Mx script: 


TITLE: multiplicative model with any number of facets 

DAta N O bservat i on s=(« r. of observations) NInput= (p) NGroup=l 

CM Fl=(file with sample covariance or correlation matrix ) 

MATrices D Diagonal p p FREE 

A STandardized a a FREE 

B STandardized b b FREE 

(etc.) 

X Diagonal a a FREE 
Y Diagonal b b FREE 
Z Diagonal a a FREE 
(etc.) 

U Diagonal p p FREE 

CO (X@Y@- ■ ■ @Z)*(A@B@C@ ■ ••(§>£ + U.U)*(X@Y@- ■ ■ @Z) / 
STart (starting values) 

Options (here you can specify e.g., the number of iterations) 
end 


When we analyze the correlation matrix we need to specify a model 
for the population correlation matrix. If we standardize the covariance ma- 
trix, this only affects the elements of D^ 1 , which now represent the ratios of 
sample standard deviations to common score standard deviations. 
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An Application of the Composite Direct 
Product Model 

The Miller and Lutz (1966) data consist of the scores of 51 education 
students on a test designed to assess teacher’s judgements about the effect of 
situation and instruction factors on the facilitation of student learning. The 
measures where constructed according to a facet design with three facets with 
two conditions each: 

1. A: Grade level of the student. Ai denotes the first grade and A 2 the 
sixth grade. 

2. B: Teacher approach. B x denotes a teacher-centered approach and 
B 2 a pupil-centered approach. 

3. C: Teaching method. Cj denotes an approach where teaching con- 
sisted mainly of rote learning activities. C 2 denotes an approach in which the 
teacher attempts to develop pupil understanding without much emphasis on 
rote memorization. 

The Miller-Lutz data were analyzed by Wiley, Schmidt, and Bramble (1973), 
and Joreskog (1973) using the additive model, and detailed results can be 
found there. Note that Joreskog (1973, p. 32) used Browne’s parametrization. 
The CC model shows a reasonable fit to these data; x 2 (21) = 37.97, p = 
0.01, and the Root Mean Square Error of Approximation (RMSEA) equals 
0.11 (see Steiger & Lind, 1980 or McDonald, 1989). The results indicate 
that differences between the drill and the discovery methods of instruction 
caused most variation in the responses of the education students. Differences 
in teacher approach showed least variation. 

We have used the covariance matrix reported by Joreskog (1973, Ta- 
ble 10) to estimate the parameters of the CDP model. The model did not fit 
the data as judged from the chi-square statistic (x 2 (19) = 49.4, p < 0.001, 
RMSEA = 0.18) as well as the residuals. For illustration we give some of 
the results here: 



1 


1 

P.4 = 

0.89(0.81 - 0.95) 1 

■Pb = 

0.92(0.84- 0.98) 1 
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and 


P c = 


1 

0.42 ( 0.19 


0 . 61 ) 1 


Within brackets are 95-percent confidence intervals (Neale & Miller, 1997). 
The results confirm the conclusion that, in the view of the coming teachers, 
differences in teaching method are more important than differences on any 
other facet. On the other hand, there is no substantive reason to support the 
CDP model for these data. 


Concluding Remarks 

In this paper we have extended models that were conceived for the anal- 
ysis of MTMM data. We have demonstrated how the models are derived from 
the model for the observations and how they can be fitted using the LISREL 
or the Mx program. Note that Mx can handle all models that have been dis- 
cussed. A minor disadvantage of Mx program is that it uses numerical deriva- 
tives which may make the optimization algorithm less stable, sometimes. 

Models that are similar to the CDP model are described by Swain 
(1975) and Verhees and Wansbeek (1990) and the Mx script described above 
is easily adapted to fit these models. It is possible and indeed not difficult to 
formulate hybrid models combining an additive specification of some facets 
and a multiplicative specification for others. Such models are not difficult to 
fit using Mx. However, unless there is a strong theoretical interest in such 
models, fitting them would merely be an exercise in SEM. This brings us to 
an important point. To wit, although the CDP model has been found useful to 
describe MTMM correlation matrices, it represents a strong hypothesis on the 
data. We find it somewhat disturbing that the vast majority of the applications 
of the CDP model to MTMM matrices that we know provide no substantive 
arguments for use of the model. An exception being, for instance, Bagozzi, Yi 
and Phillips (1991). Even studies where multiplicative and additive models 
are compared (e.g., Hernandez Baeza & Gonzalez Roma, 2002) focus almost 
exclusively on the relative fit of the models. At most, authors (e.g., Cudeck, 
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1988, p. 141) refer to the work by Campbell and O’Connell (1967; 1982) 
who observed that for some MTMM correlation matrices, inter-trait correla- 
tions are attenuated by a multiplicative constant (smaller in magnitude than 
unity), when different methods are used. 

In closing, we mention two topics for future research. First, it is nec- 
essary to establish the identifiability of the (reparameterized) CC and CDP 
model. Although we believe these models to be identifiable there is no general 
proof available that they are identifiable for any number of facets. Second, we 
would like to have ways to perform exploratory analysis on the within-facet 
covariance (or correlation) matrices. A suggestion is to use a model incor- 
porating principal components. Such model have been considered (for two- 
facets) by Flury and Neuenschwander (1995). Dolan, Bechger, and Molenaar 
(1999) suggest how these model can be fitted in a SEM framework. 
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Appendix 


Title Hoeks data CC model on correlations 
Da No=620 Ni=18 
KM SY 
1.000 

0.502 1.0000 


0.512 

0.475 

1.0000 







0.419 

0.313 

0.419 

1.000 







0.457 

0.457 

0.430 

0.385 

1.000 






0.481 

0.401 

0.479 

0.431 

0.405 

1.000 





0.568 

0.519 

0.530 

0.501 

0.553 

0.497 

1.000 




0.456 

0.396 

0.494 

0.512 

0.491 

0.449 

0.637 

1.000 



0.527 

0.470 

0.444 

0.453 

0.490 

0.491 

0.680 

0.675 

1.000 


0.485 

0.433 

0.418 

0.373 

0.415 

0.361 

0.544 

0.425 

0.533 

1.000 

0.426 

0.388 

0.392 

0.384 

0.407 

0.339 

0.445 

0.413 

0.436 

0.441 

1.000 










0.293 

0.282 

0.306 

0.259 

0.262 

0.253 

0.290 

0.222 

0.271 

0.273 

0.243 

1.000 









0.573 

0.508 

0.577 

0.487 

0.482 

0.516 

0.642 

0.555 

0.558 

0.516 

0.404 

0.303 

1.000 








0.516 

0.389 

0.465 

0.390 

0.420 

0.436 

0.517 

0.456 

0.461 

0.434 

0.348 

0.286 

0.582 

1.000 







0.497 

0.398 

0.410 

0.445 

0.452 

0.483 

0.616 

0.572 

0.598 

0.552 

0.401 

0.247 

0.601 

0.494 

1.000 






0.160 

0.101 

0.212 

0.151 

0.093 

0.256 

0.152 

0.196 

0.156 

0.186 

0.117 

0.137 

0.166 

0.119 

0.159 

1.000 





0.206 

0.106 

0.140 

0.162 

0.204 

0.239 

0.131 

0.103 

0.073 

0.170 

0.178 

0.333 

0.215 

0.288 

0.207 

0.106 

1.000 




0.251 

0.168 

0.226 

0.233 

0.250 

0.378 

0.248 

0.229 

0.221 

0.250 

0.169 

0.272 

0.293 

0.267 

0.276 

0.440 

0.290 

1.000 




mo ny=18 ne=18 nk=4 ps=ze,fi ly=di,fr ga=fu,fi ph=sy,fr 
MA gamma 
1 00 0 !00 
1 0 0 0 11 0 
1 00 0 10 1 
1 00 0 100 
1 000 11 0 
1 00 0 10 1 
1 0 1 0 10 0 
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10 10 110 
1 0 1 0 10 1 
1 1 1 0 10 0 
1110 110 
1 1 1 0 10 1 
1 1 0 1 10 0 
110 1110 
1 1 0 1 10 1 
1 1 0 1 10 0 
110 1110 
1 1 0 1 10 1 
pa ph 
0 

0 1 
00 1 
000 1 
10 0 00 1 
10 0 00 1 1 
va 1 ph(l,l) 
st 0.3 all 

ou se rs ULS it=40000 


Note that anything after a 1 is ignored by LISREL. We have kept it here to illustrate 
how the script was changed from the first to the final analysis. 



